Using Noun Phrase Centrality to Identify Topics for Extraction based Summaries
نویسندگان
چکیده
In this paper, we use a Social Network Analysis method and decision tree analysis to study the distribution and relationship of Noun Phrases in documents and their corresponding abstracts. Initial results have shown significant improvement in extraction based text summarization by applying systematic predictions of the Noun Phrases that appear in both the documents and in their corresponding abstracts.
منابع مشابه
Keyword and Keyphrase Extraction Using Centrality Measures on Collocation Networks
Keyword and keyphrase extraction is an important problem in natural language processing, with applications ranging from summarization to semantic search to document clustering. Graph-based approaches to keyword and keyphrase extraction avoid the problem of acquiring a large in-domain training corpus by applying variants of PageRank algorithm on a network of words. Although graph-based approache...
متن کاملNoun Phrase Recognition with Tree Patterns
This paper offers a method for the noun phrase recognition of Hungarian natural language texts based on machine learning methods. The approach learns noun phrase tree patterns described by regular expressions from an annotated corpus. The tree patterns are completed with probability values using error statistics. The noun phrase recognition parser tries to find the best-fitting trees for a sent...
متن کاملWord Formation Approach to Noun Phrase Analysis for Thai
Noun phrase analysis is one of the most important components in Natural Language Processing (NLP) applications, such as information retrieval, extraction and categorization. For Thai, noun phrase analysis has unique problems, i.e., noun phrase boundary identification, noun phrase decomposition and its relation extraction, and core noun detection. Statistical and rule based Word formation is, th...
متن کاملCentrality Measures in Text Mining: Prediction of Noun Phrases that Appear in Abstracts
In this paper, we study different centrality measures being used in predicting noun phrases appearing in the abstracts of scientific articles. Our experimental results show that centrality measures improve the accuracy of the prediction in terms of both precision and recall. We also found that the method of constructing Noun Phrase Network significantly influences the accuracy when using the ce...
متن کاملDocument Retrieval and Routing Using the INQUERY System
The INQUERY retrieval and routing system, which is based on the Bayesian inference net retrieval model, has been described in a number of papers 5, 4, 10, 11]. In the TREC experiments this year, a number of new techniques were introduced for both the ad-hoc retrieval and routing runs. In addition, experiments with Spanish retrieval were carried out. For the ad-hoc retrieval experiments, the maj...
متن کامل